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Abstract 

We present a complete solution to the so-called tennis ball problem, which is equiv- 
alent to counting lattice paths in the plane that use North and East steps and lie 
between certain boundaries. The solution takes the form of explicit expressions for the 
corresponding generating functions. 

Our method is based on the properties of Tutte polynomials of matroids associated 
to lattice paths. We also show how the same method provides a solution to a wide 
generalization of the problem. 

1 Introduction 

The statement of the tennis ball problem is the following. There are 2n balls numbered 
1, 2, 3, ... , 2n. In the first turn balls 1 and 2 are put into a basket and one of them is 
removed. In the second turn balls 3 and 4 are put into the basket and one of the three 
remaining balls is removed. Next balls 5 and 6 go in and one of the four remaining balls is 
removed. The game is played n turns and at the end there are exactly n balls outside the 
basket. The question is how many different sets of balls may we have at the end outside 
the basket. 

It is easy to reformulate the problem in terms of lattice paths in the plane that use steps 
E = (1, 0) and N = (0, 1). It amounts to counting the number of lattice paths from (0, 0) to 
(n, n) that never go above the path NE ■ ■ ■ NE = (NE)^. Indeed, if tt = 7ri7r2 . . . iT2n-i'^2n 
is such a path, a moment's thought shows that we can identify the indices i such that 
Tr2n-i+i is a N step with the labels of balls that end up outside the basket. The number of 
such paths is well-known to be a Catalan number, and this is the answer obtained in [2]. 

The problem can be generalized as follows [3]. We are given positive integers t < s and 
sn labelled balls. In the first turn balls 1, . . . , s go into the basket and t of them are removed. 
In the second turn balls s -|- 1, . . . , 2s go into the basket and t among the remaining ones 
are removed. After n turns, tn balls lie outside the basket, and again the question is how 
many different sets of balls may we have at the end. Letting k = t,l = s — t, the problem 
is seen as before to be equivalent to counting the number of lattice paths from (0, 0) to 
{In, kn) that use N and E steps and never go above the path N^E^ ■ ■ ■ N^E^ = {N'^E'-)'^. 
This is the version of the problem we solve in this paper. 
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From now on we concentrate on lattice paths that use N and E steps. To our knowledge, 
the only cases solved so far are k = 1 and k = I = 2. The case = 1 is straightforward, the 
answer being a generalized Catalan number . The case k = I = 2 (corresponding 

to the original problem when s = 4, i = 2) is solved in using recurrence equations; here 
we include a direct solution. This case is illustrated in Fig. ^ to which we refer next. 
A path vr not above {N'^E'^)^ is "almost" a Catalan path, in the sense that it can raise 
above the dashed diagonal line only through the dotted points. But clearly between two 
consecutive dotted points hit by vr we must have an E step, followed by a Catalan path of 
odd semilength, followed by a step. Thus, vr is essentially a sequence of Catalan paths 
of odd semilength. If G{z) = ^nT^^Cn)-^"^ generating function for the Catalan 

numbers, take the odd part Go{z) = {G{z) — G{—z))/2. Then expand 1/(1 — zGo{z)) to 
obtain the sequence 1, 6, 53, 554, 6363, . . . , which agrees with the results in 
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Figure 1: The path vr = EENNNEEEENNNNNEE not above P = {N'^E'^Y. It has 
2(vr) = 3 and e(vr) = 2, corresponding to the steps underlined. 



Let P be a lattice path from (0, 0) to (m, r), and let h{P) be the number of paths from 
(0, 0) to (m, r) that never go above P. If PN denotes the path obtained from P by adding 
a N step at the end of P, then clearly h{P) = h{PN). However, it is not possible to express 
b{PE) simply in terms of b{P), where PE has the obvious meaning. As is often the case 
in counting problems, one has to enrich the objects under enumeration with additional 
parameters that allow suitable recursive decompositions. This is precisely what is done 
here: equations @ and © in the next section contain variables x and y, corresponding to 
two parameters that we define on lattice paths not above a given path P. These equations 
are the key to our solution. 

The basis of our approach is the connection between lattice paths and matroids es- 
tablished in where the link with the tennis ball problem was already remarked. For 
completeness, we recall the basic facts needed from 1 in the next section. In Section |21 we 
present our solution to the tennis ball problem, in the form of explicit expressions for the 
corresponding generating functions; see Theorem ^ In Section |3 we show how the same 
method can be applied to a more general problem. We conclude with some remarks. 
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2 Preliminaries 



The contents of this section are taken mainly from where the reader can find additional 
background and references on matroids, Tutte polynomials and lattice path enumeration. 

A matroid is a pair (E, B) consisting of a finite set E and a nonempty collection B of 
subsets of E, called bases of the matroid, that satisfy the following conditions: (1) No set 
in B properly contains another set in B, and (2) for each pair of distinct sets B,B' in B 
and for each element x ^ B — B', there is an element y G B' — B such that {B — x) Uy is 
in B. 

Let P be a lattice path from (0, 0) to (m, r). Associated to P there is a matroid Af [P] 
on the set {1, 2, . . . , m + r} whose bases are in one-to-one correspondence with the paths 
from (0, 0) to (m, r) that never go above P. Given such a path vr = 7ri7r2 . . . iTm+r, the basis 
corresponding to tt consists of the indices i such that vrj is a step. Hence, counting bases 
of M[P] is the same as counting lattice paths that never go above P. 

For any matroid M there is a two-variable polynomial with non-negative integer coeffi- 
cients, the Tutte polynomial t{M; x, y). It was introduced by Tutte jBj and presently plays 
an important role in combinatorics and related areas (see (Bj). The key property in this 
context is that t{M; 1, 1) equals the number of bases of M. 

Given a path P as above, there is a direct combinatorial interpretation of the coefficients 
of t{M[P\;x, y). For a path vr not above P, let i(7r) be the number of A^ steps that vr has in 
common with P, and let e(7r) be the number of E steps of vr before the first A^ step, which 
is if vr starts with a A^ step. See Fig. ^for an illustration. 

Then we have (see ^ Th. 5.4]) 

t(M[P];x,y) = J^x^W/W, (1) 

where the sum is over all paths tt not above P. A direct consequence is that t(M[P]; 1, 1) 
is the number of such paths. 

Furthermore, for the matroids M[P] there is a rule for computing the Tutte polynomial 
that we use repeatedly (see [H Section 6]). If PA^ and PE denote the paths obtained from 
P by adding a A^ step and an E step at the end of P, respectively, then 

t{M[PN];x,y) = x t(M[P], x, y), (2) 
t{M[PE];x,y) = ^^t{M[P\,x,y) + {y - t{M[P\-l,y). (3) 

The right-hand side of (jSJ is actually a polynomial, since x — 1 divides t(M[P];x,?/) — 
t{M[P]; The key observation here is that we cannot simply set x = y = 1 in to 

obtain an equation linking t{M[PE]] 1, 1) and t{M[P]] 1, 1). 

For those familiar with matroid theory, we remark that M[PA^] and M[PE] are obtained 
from M[P], respectively, by adding an isthmus and taking a free extension; it is known that 
formulas (HJ and Q correspond precisely to the effect these two operations have on the 
Tutte polynomial of an arbitrary matroid. 

From and the definition of i(7r) and e(7r), equation Q is clear, since any path 
associated to M[PA'] has to use the last A^ step. For completeness, we include a direct 
proof of equation (j^J . 
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We first rewrite the right-hand side of @ as 



^—^{t{M[P]-x,y)-t{M[P]-l,y))+yt{M[P]-l,y) = 

_ 1) + y-W + l = 

TT 
TT 

where the sums are taken over all paths vr that do not go above P. 

To prove the formula, for each path tt not above P we find i(7r) + 1 paths not above PE 
such that their contribution to t(M[P£']; x, y) is y^^^\y + x + + • • • + x*^'^)). Consider 
first the path ttq = En; it clearly does not go above PE and its contribution to the Tutte 
polynomial is Now for each j with 1 < j < i(vr), define the path ttj as the path 

obtained from tt by inserting an E step after the jth N step that tt has in common with 
P (see Fig. |2I). The path irj has exactly j N steps in common with PE, and begins with 
e(7r) -E steps. 
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Figure 2: Illustrating the combinatorial proof of formula ©■ 



It remains only to show that each contribution to the Tutte polynomial of M[PE] arises 
as described above. Let tt' be a path that never goes above PE and consider the last N 
step that tt' has in common with PE; clearly the next step must be E. Let vr be the path 
obtained after removing this E step. Since vr' had no N steps in common with PE after 
the E step removed, the path vf does not go above P. Thus the path vr' can be obtained 
from vr by adding an E step after the i(vr')-th N step that vf has in common with P, and 
hence vr' arises from vr as in the previous paragraph. It is easy to show that vr' cannot be 
obtained in any other way by applying the procedure described above, and this finishes the 
proof. 
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3 Main result 

Let /c, / be fixed positive integers, and let P„ = [N^E^)^. Our goal is to count the number 
of lattice paths from (0,0) to {ln,kn) that never go above P^. From the considerations in 
the previous section, this is the same as computing t(M[P„]; 1, 1). Let 

An = Anix,y) = t{M[Pn];x,y). 

By convention, Pq is the empty path and Aq = 1. 

In order to simplify the notation we introduce the following operator $ on two- variable 
polynomials: 

^A{x,y) = -^A{x,y) + (y ^ ) A{l,y). 

X — 1 \ X — 1 J 

Then, by equations ^ and Q we have 

An+l = <^>^{x^An), 

where denotes the operator $ applied i times. 

For each n > and i = 1, ... ,1, we define polynomials B^ nix, y) and Ci^niy) as 

Bi,n = {x^An{x,y)'^ , 

We also set Co^n{y) = An{l,y). Notice that Bi^n = ^n+ij and Co,n(l) = An{l,l) is the 
quantity we wish to compute. 

Then, by the definition of we have: 

Bl,n = ^X^An +(y- ^] Co,n; 
B2,n = ^^l,n +[y-^) Cl,n; 



Bl,n = T^Bi^i^n + [y — Q-l.n,; 
An+l = Bi^n- 

In order to solve these equations, we introduce the following generating functions in the 
variable z (but recall the coefficients are polynomials in x and y): 

A = ^ ^ AnZ^ , Ci = ^ ^ Ci^nZ"' , i = 0, . . . , I. 

n>0 n>0 

We start from the last equation An+i = Bi^n and substitute repeatedly the value of Bi^n 
from the previous equation. Taking into account that X^^^n+i-^" = (^ ~ l)/-^) a simple 
computation yields 

A-1 , , ^ 
-. rjA + (yx — y — x) > -. 

{x-iy ^ ^ (^-1) 
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We now set y = 1 and obtain 

I 

A ({x - ly - = (x - 1)' - z ^ x'-\x - if-' Ci_i, (4) 



i=l 

where it is understood that from now on we have set y = 1 in the series A and Cj. 
By Puiseux's theorem (see [SJ Chap. 6]), the algebraic equation in w 

{w - ly - zw^+^ = (5) 

has k + I solutions in the field C^''^((z)) = {Yln>no o.n.z'"'^^} of fractional Laurent series. 
Proposition 6.1.8 in ^ tells us that exactly I of them are fractional power series (without 
negative powers of z); let them be wi{z),. . . ,wi{z). 

We substitute x = wj in Q) for j = 1, . . . , /, so that the left-hand side vanishes, and 
obtain a system of / linear equations in Co, Ci, . . . , C;_i, whose coefficients are expressions 
in the Wj, namely 

I 

^ w]~' {w, - ly-'zQ., = {w, - 1)^ j = i,...,i. (6) 

i=l 

Notice that, since the Ci are (ordinary) power series, the solutions w of ^ that we substi- 
tute in (jU cannot have negative powers of z, hence they must be wi, . . . ,wi. We remark 
the similarity of this technique with the one devised by Tutte for counting rooted planar 
maps (see, for instance, 

It remains only to solve © to obtain the desired series Cq = ^„ j4„(1, The 
system © can we written as 

Y^izTTj) zCi_,_i = Wj - 1, j = l,...,l. 

The left-hand sides of the previous equations can be viewed as the result of evaluating the 
polynomial Yli=Qi^^i-i-i)-^^ of degree / — 1 at X = Wj/{wj — 1), for j with 1 < j < I. Using 
Lagrange's interpolation formulas, we get that the coefficient of X^~^ in this polynomial is 



zco=y: 



Wj 



By straightforward manipulation this last expression is equal to 

- 17(1 - E = - 17(1 - w,), 

where the last equality follows from an identity on symmetric functions (set r = in 
Exercise 7.4 in 0). 

Thus we have proved the following result. 
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Theorem 1. Let k,l be positive integers. Let qn be the number of lattice paths from (0,0) 
to {In, kn) that never go above the path {N^E^)^ , and let wi, . . . ,wi be the unique solutions 
of the equation 

{w - ly - zw''^^ = 

that are fractional power series. Then the generating function Q{z) = X]n>o InZ^ is given 
by 

z 

Remark that, by symmetry, the number of paths not above (N^E^)"' must be the same 
as in Theorem ahhough the algebraic functions involved in the solution are roots of a 
different equation. 

In the particular case k = I the solution can be expressed directly in terms of the 
generating function G{z) = '^nnTiCn)'^^ Catalan numbers, which satisfies the 

quadratic equation G{z) = 1 + zG{z)'^. Indeed, © can be rewritten as 

w = 1 + z^/^uP', 

whose (fractional) power series solutions are G(C^zi/^), j = 0,...,fc- 1, where C IS a 
primitive k-ih root of unity. For instance, for k = I = 3 (corresponding to s = 6, t = 3 in 
the original problem), = exp(27ri/3) and we obtain the solution 

— (1 - G{z^^^)){l - G{Cz^/^)){l - G(C^zi/3)) = 
z 

1 + 20z + 662^2 + 267802^ + 1205961z^ + 58050204^^ + • • • . 

In the same way, if / divides k and we set p = {k + l)/l, the solution can be expressed in 
terms of the generating function (^p_i^n+i (^)'^" generalized Catalan numbers; the 
details are left to the reader. As an example, for = 4, / = 2, we obtain the series 

^(l-if(.V2))(l_^(_,l/2)) = 

1 + 15z + SeOz^ + 10463z^ + 337269/ + 11599668z^ + • • • , 
where H{z) = En aTTad") satisfies H{z) = 1 + zH{z)\ 

4 A further generalization 

In this section we solve a further generalization of the tennis ball problem. Given fixed 
positive integers si, ti, . . . , s^, with ti < Si for all i, let s = Yl^i^^ — Yl^i- There are 
sn labelled balls. In the first turn we do the following: balls 1, . . . , si go into the basket 
and ti of them are removed; then balls si + 1, . . . , si + S2 go into the basket and among 
the remaining ones t2 are removed; this goes on until we introduce balls s — Sr + 1, ■ ■ ■ , s, 
and remove tr balls. After n turns there are tn balls outside the basket and the question 
is again how many different sets of tn balls may we have at the end. 

The equivalent path counting problem is: given ki,li, . . . ,kr,lr positive integers with 
k = ^ki, I = Ylh, count the number of lattice paths from (0,0) to {ln,kn) that never 
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go above the path P„ = (N^^E^^ ■ ■ ■ N^^E^^)"'. The solution parallels the one presented in 
Sectional We keep the notations and let An = t{M[Pn];x,y), so that 

As before, we introduce I polynomials Bi^n{x, y) and Ci^niv) = Bi^n{^, y), but the definition 
here is a bit more involved: 

=¥ix>'^An), i = l,...,h; 

Bi,+i,n =¥{x''^Bi,^n), i = l,...,/2; 

Bi,+i,+,,n = $Ha^''^«i+«2,n), i = l,...,k; (7) 
Bl-l,+i,n = ^'{X^'-Bi^i^^n), i = l,...,lr. 

We also set Co^niv) = An{l, y). Again, from the definition of we obtain a set of equations 
involving A„, An+i = B^n, the Bi^n and Ci^„. We define generating functions A and Cj 
{i = 0, . . . ,1) as in Section |31 

Starting with A^+i = Bi^ni we substitute repeatedly the values of the from previous 
equations and set y = 1. After a simple computation we arrive at 

A ((x - 1)' - zx^+') = {x-iy + z U{x, Co, ... , (8) 

where ?7 is a polynomial in the variables x,Co, . . . ,Ci^i. Observe that the difference 
between (jHJ and equation (jlj is that now U is not a concrete expression but a certain 
polynomial that depends on the particular values of the ki and li. 

Let wi, . . . ,wi be again the power series solutions of ©• Substituting x = Wj in Q 
for j = 1, . . . ,/, we obtain a system of linear equations in the Cj. Since the coefficients 
are rational functions in the Wj, the solution consists also of rational functions; they are 
necessarily symmetric since the Wj, being conjugate roots of the same algebraic equation, 
are indistinguishable. 

Thus we have proved the following result. 

Theorem 2. Let ki,li, . . . ,kr,lr be positive integers, and let k = ^ki, I = '^li. Let 
Qn be the number of lattice paths from (0, 0) to {In, kn) that never go above the path 
[N^^E^^ ■ ■ ■ j^ir^n ^ let wi, . . . ,wi be the unique solutions of the equation 

{w - ly - zw''+^ = 

that are fractional power series. Then the generating function Q{z) = ^n>QQnZ^ is given 
by ^ 

Q{z) = -R{wi, . . .,wi), 

where R is a computable symmetric rational function of wi, . . . ,wi. 

As an example, let r = 2 and (ki, li,k2, h) = (2, 2, 1, 1), so that k = I = 3. Solving the 
corresponding linear system we obtain 

^ {I - Wi){l - W2){1 - Ws) 



2vJiW2Ws — {wiW2 + W1W3 + W2W3) ' 
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and 



Q{z) = ^R=1 + I6z + 503^2 + 19904^3 + 885500z^ + 42298944z^ 



It should be clear that for any values of the ki and /j the rational function R can be 
computed effectively. In fact, a simple computer program could be written that on input 

ki,li, . . . , kr,lr, outputs R. 

5 Concluding Remarks 

It is possible to obtain an expression for the generating function of the full Tutte polynomi- 
als An{x, y) defined in Section |SI We have to find the values of Co, Ci, . . . , C;_i satisfying 
the system @ and substitute back into @. After some algebraic manipulation, the final 
expression becomes 



n>0 



(^zx^+i - [x- iy){y - wi{y - I)) ■ ■ ■ {y - wi{y - 1)) ' 



Taking x = y = 1 we recover the formula stated in Theorem ^ 

On the other hand, references [Sj and also study a different question on the tennis 
ball problem, namely to compute the sum of the labels of the balls outside the basket for 
all possible configurations. For a given lattice path P„, this amounts to computing the sum 
of all elements in all bases of the matroid M[P„]. We remark that this quantity does not 
appear to be computable from the corresponding Tutte polynomials alone. 

Finally, as already mentioned, the technique of forcing an expression to vanish by 
substituting algebraic functions was introduced by Tutte in his landmark papers on the 
enumeration of planar maps. Thus the present paper draws in more than one way on the 
work of the late William Tutte. 
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